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TETRACYCLINE REPRESSOR— MEDIATED BINARY 

REGULATION SYSTEM FOR CONTROL OF 
GENE EXPRESSION IN TRANSGENIC ANIMALS 



1- INTRODUCTION 
The present: invention relates to a tetracycline 
repressor-mediated binary regulation system for the 
control of gene expression in transgenic animals. It 
is based, at least in part, on the discovery that, in 
a non-human transgenic animal that carries a first 
transgene under the control of a modified promoter 
comprising a tetR operator sequence and a second 
transgene encoding the tetR repressor protein, 
expression of the first transgene may be efficiently 
induced by administering tetracycline to the animal, 

2. BACKGROUND OF THE INVENTION 

2.1. CONTROL OF GENE EXPRESSION 
IN TRANSGENIC ANIMALS 

The production of transgenic animals for both 
experiment and agricultural purposes is now well known 
(Wilmut et al., 7 July 1988, New Scientist pp. 56-59). 
In research, transgenic animals are a powerful tool 
that have made significant contributions to our 
understanding of many aspects of biology and have 
contributed to the development of animal models for 
human diseases (Jaenisch, 1988, Science 240 : 1468- 
1474) . It is also clear that several livestock 
species can be made transgenic and these species 
promise to expand and revolutionize the method of 
production and diversity of pharmaceutical products 
available in the future, in addition to improving the 
agricultural qualities of the livestock species 
(Wilmut et al., supra ) . 

A critical, often neglected, aspect of developing 
transgenic animals is the process whereby expression 



WO 94/04672 PCT/US93/08230 

- 2 - 



of the newly introduced gene, referred to as the 
transgene, is controlled. This is an important 
process since stringent regulation of transgene 
5 expression is often important both for practical, 
regulatory and safety reasons and to maintain the 
health of the transgenic animal. In the past either 
"inducible" or "tissue specific" regulatory mechanisms 
have been used. Inducible regulation is defined 

xo herein as a method of gene regulation which allows for 
some form of outside manipulation of the onset and/or 
level of transgene expression. Tissue specific 
regulation is defined herein as a method for targeting 
transgene expression to particular tissues or organs. 

15 Inducible gene regulation may be achieved using 

relatively simple promoter systems such as the 
metal lothionein heat shock promoters, or by using 
promoters which are responsive to specific compounds 
such as the Mouse mammary tumor virus LTR which is 

2o responsive to glucocorticoid stimulation. More 

flexible, though more complex inducible regulation 
systems can be achieved through a "binary" gene 
approach which utilizes a transactivator gene product 
to control expression of a second gene of interest. 

25 Tissue specific gene regulation usually consists of 
simple single gene methods (Byrne et al., 1989, Proc. 
Natl. Acad. Sci. U.S.A. 86:5473-5477; Ornitz et al., 
1991, Proc. Natl. Acad. Sci. U.S.A. 88:698-702), 
although binary transactivator systems can also 

30 provide a high degree of tissue specificity. 

These current systems provide only a limited 
ability to control the time of transgene expression 
within individual animals. In this respect tissue 
specific promoter elements provide no method to 

35 control the onset of transgene activity, but function 
merely to target gene expression to defin d sites. 
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Simple inducible promoters such as metal loth ione in 
generally lack tissue specificity and usually have 
some aspect of endogenous basal expression which 
5 cannot be controlled. Thus even for the extensively 
used inducible metallothionsin promoter this approach 
at best only permits selection of the time at which a 
relative increase in transgene expression can be 
induced* 

10 Binary transactivation systems typically consist 

of two transgenic animals. One animal contains the 
gene of interest controlled by a promoter element that 
requires a specific transactivator gene product for 
expression. Thus, the gene of interest is not 

15 expressed in the absence of the transactivator. A 

second transgenic animal is then made which expresses 
the required transactivator in the desired tissue. By 
mating these two transgenic animals, offspring 
containing both the gene of interest and the 

20 transactivator transgene can be produced. Only in 
these doubly transgenic animals is the gene of 
interest expressed. Since expression of the gene of 
interest requires the transactivator, this binary 
approach dramatically reduces or eliminates any 

25 undesirable basal expression inherent in simple 

inducible systems. Additionally, if expression of the 
transactivator is targeted using a tissue specific 
promoter, then in the double transgenics, expression 
of the gene of interest is in effect targeted to the 

30 same specific tissue. Binary systems provide 
therefore a low resolution method of temporal 
regulation in as much as they allow the determination 
of which generation of animals will express the gene 
of interest. These systems provide little ability, 

35 however, to control the time and level of gene 

expression within an individual transgenic animal. 
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For many applications it is necessary to 
accurately control the time and pattern of transgene 
expression within an individual transgenic animal. 
5 For example, many attempts have been made to produce 
transgenic pigs which express increased levels of 
growth hormone (Vize et al., 1988, J. Cell Sci. 
£0:295-300;; Pinkert et al., 1990, Dom. Animal 
Endocrinol. 7:1-18). Elevated growth hormone levels 

l0 dramatically decrease the amount of body fat in pigs, 
and increase the animals overall feed efficiency. 
These effects would be beneficial, both to the 
consumer who could purchase a leaner, healthier 
product, and to the producer who can profit from 

xs having a more efficient animal. To date however, all 
attempts to increase the level of growth hormone 
through production of transgenic pigs have also 
produced serious pathological conditions which greatly 
reduce the health of the animals. These pathologies 

2o are the direct result of uncontrolled, constitutive 

expression of growth hormone, since many studies using 
exogenous hormone administration for short periods of 
time have not produced pathologies, while still 
benefiting feed efficiency and fat content. In this 

25 situation, a regulatory method to control onset and 
level of expression from a growth hormone transgene 
would be extremely useful. 



2.2. REPRES SQR— MED I ATED GENE CONTROL 
30 Transcriptional repressors are usually allosteric 

DNA binding proteins with at least two functional 
sites. One site on the protein is used to bind DNA. 
The DNA binding site binds to a defined DNA sequence 
which is known as the operator site. Operator sites 
35 usually consist of palindromic sequences of 12 or more 
base pairs. A gene which is regulated by a repressor 



WO 94/04672 



5 - 



PCT/US93/08230 



must have at least one operator site located within 
its promoter/ regulatory region. A second site on the 
repressor protein binds a specific ligand, usually a 
5 small macromolecule such as an amino acid, sugar, or 
antibiotic. Wheri the ligand is bound to the 
repressor, it causes a conformational shift such that 
the affinity of the repressor for the operator 
sequence is greatly reduced. For this reason, the 

10 ligand is frequently referred to as the "inducer", 

since it causes the repressor to disassociate from the 
operator, thereby eliminating the repressor's effect 
and allowing expression of the gene. 

Only the bacterial repressors Lacl, LexA and tetR 

15 have been shown to function in mammalian (Lacl and 

LexA) or plant (tetR) tissue culture cells. The first 
report of utilizing bacterial repressors in eukaryotes 
was from Brent and Ptashne who showed that LexA could 
function in yeast (1984, Nature 3^2:612-615). 

20 Subsequently, both LexA and Lacl have been shown to 

function in mammalian tissue cjulture systems (Smith et 
al., 1988, EMBO J. 2=3975-3981). Of these repressors 
Lacl has been most extensively studied. For Lacl 
repression, single or multiple operator sites have 

25 been positioned in three major locations: (i) between 
the transcription start site and the first codon of 
the mRN A; (ii) between the TATA-box sequence and the 
transcription start site; and (iii) between the TATA- 
box sequence and any more distal regulatory signal 

30 sequences. These studies reveal two predominant 
results. First, operators located in all three 
positions were effective in rendering the modified 
promoter subject to Lacl repression. Second, the 
presence of multiple operator sequences allowed 

35 greater 1 vels of repression than did single operator 
insertions. From these studies it appears the Lacl 
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repressor causes repression of mammalian promoters 
through two basic mechanisms* If the operators are 
located downstream of the transcription start site, 
5 Lacl appears to block expression by inhibiting mRNA 
elongation. That is to say, the Lacl repressor blocks 
the progress of RNA polymerase by steric interference. 
When operator sequences are located in other 
positions, Lacl seems to inhibit protein-protein 

10 interactions between the cellular factors normally 
involved in transcription initiation. 

Gatz and Quail (1988, Proc. Natl. Acad. Sci. 
U.S.A. 85:1394-1397) have demonstrated tetR function 
in a plant protoplast culture system. Plant 

15 protoplasts were transfected with a tetR gene 

expressed from a cauliflower mosaic virus (CAMV) 
promoter along with a CAT reporter gene, regulated by 
a modified CAMV promoter. In contrast to the results 
with Lacl, Gatz and Quail showed that tetR operators 

2o positioned between the transcription start site and 

the first codon of the CAT mRNA were not responsive to 
tetR repression. Therefore the tetR protein does not 
appear to be able to block the procession of RNA 
polymerase. Effective repression by tetR was only 

25 observed when the operator sequence was positioned 

such that the CAMV TATA-box element was flanked by the 
two 19bp palindromes of the tetR operator. With this 
modification, effective repression of the reporter 
gene, and induction with tetracycline could be 

30 achieved. This suggests that repression by tetR 

specifically inhibits the initiation of transcription, 
in this case apparently by blocking the binding of the 
TATA-box binding factors. 

Recently the tetR system has been shown to 

35 function in transgenic plants. Gatz et al. (1991, 
Mol. Gen. Genet. 222:229-237) have introduced their 
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operator sites flank the TATA-box into transgenic 
tobacco plants. Unexpectedly, this promoter, which 
5 exhibited very good regulation in tissue culture 
assays vas not very effective in regulating gene 
expression in transgenic plants. Instead they found 
that effective repression and induction in transgenic 
plants occurred when the operator sites were 
10 positioned just downstream of the normal transcription 
start site. 

3. SUMMARY OF THE INVENTION 
The present invention relates to a tetracycline 

15 repressor-mediated binary regulation system for the 
control of gene expression in non-human transgenic 
animals. It is based, at least in part, on the 
discovery that in transgenic mice carrying two 
transgenes, the first encoding bovine growth hormone 

20 (bGH) under the control of a PEPCK promoter modified 
to comprise the tetR operator sequence at the Nhel 
site, and the second encoding tetR repressor protein 
under the control of an unmodified PEPCK promoter, 
expression of bGH could be efficiently and selectively 

25 induced by administering tetracycline to the 
transgenic mice. 

In particular embodiments, the present invention 
provides for (i) animal promoter elements modified to 
comprise a tetR operator sequence; (ii) nucleic acid 

30 molecules comprising a gene of interest under the 

control of such a modified promoter; (iii) non-human 
transgenic animals that carry a transgene under the 
control of said modified promoter and/ or a transgene 
encoding the tetR repressor protein; and (iv) a method 

35 of selectively inducing the expression of a gene of 
interest in a non-human transgenic animal comprising 
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administering tetracycline to a non-human transgenic 
animal that carries a first transgene, which is the 
gene of interest under the control of a promoter 
5 modified to comprise a tetR operator sequence and a 
second transgene encoding the tetR repressor protein . 

The present invention offers the advantage that, 
in the absence of tetracycline, expression of the gene 
of interest occurs at only very low levels due to 

10 efficient repression by tetR, In preferred, non- 
limiting embodiments of the invention, repression by 
tetR is further enhanced by utilizing a synthetic tetR 
gene which is devoid of splice signals and has 
optimized codon usage for mammalian cells. 

15 Accordingly, the present invention allows tight 

control of gene expression in transgenic animals by 
withholding or administering tetracycline. 

4. DESCRIPTION OF THE FIGURES 

2o Figure l. A. Nucleotide sequence of tetR operator 

as it occurs in TnlO, and in the oligonucleotides 
used to produce the modified PEPCK promoter 
elements. Bold face lettering represent the OPl 
and OP2 tetR binding sites. The general purpose 

25 oligonucleotide is the sequence from p337. The 

flanking EcoRI and AccI restriction sites used to 
excise this operator sequence are indicated. 
Additional restriction sites present in the 
plasmid, but not indicated here, which can be 

30 used to excise the operator include PstI, BamHI, 

Spel, Sbal, NotI, EagI, SacII, BstXI, and Sad on 
the 5 1 side and Xhol, Apal and Kpnl on the 3 9 
side. The sequence of the PEPCK-TATA box 
operator is also indicated (see methods) . 

35 Figure 1. B. Nucleotide sequence of the 337 
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operator. Lower case letters correspond to 
poly linker sequence. The 5' EcoRI and 3 1 AccI 
restriction sites used for producing the modified 
5 PEPCK promoters (Pck_A and Pck-N) are indicated. 

The 10 base pair linker beween OP1 and OP2 is 
underlined. Additional poly linker restriction 
sites available in p337 include PstI, BamHI, 
Spel, Xbal, NotI, EagI, SacII # BstXI,and SacI on 
10 the 5 % side and Xhol, Apal and Kpnl on the 3' 

side. 

Figure 2. A representation of the three modified 

PEPCK promoter elements. Construct 251 has the 
337 operator sequence integrated in the AccI site 

15 of PEPCK , just 5* of the TATA-box control 

element. Construct 252 has the 337 operator 
sequence incorporated into the Nhel site of 
PEPCK, just 3 1 of the TATA-box element. 
Construct 261 incorporates the TATA-specif ic 

20 operator sequence which is integrated between the 

5* AccI site and the 3 1 Nhel sites. 
Figure 3. Structure of the modified PEPCK controlled 
bovine growth hormone genes. The Pck_AbGH and 
PckJJbGH genes differ only in the site of 

25 operator insertion. For Pck_AbGH the operator is 

inserted at the AccI site 5' of the PEPCK TATA- 
box element. For Pck_NbGH the operator is 
inserted into the Nhel site 3 • of the TATA-box 
element (pPCK_NbGH has been deposited with the 

30 ATCC and assigned accession No: ) . In the 

Pck_TbGH gene, a TATA-box specific 
oligonucleotide was used, and this sequence was 
inserted between both the AccI and Nhel sites. 
A. Indicated the probe used for SI hybridization. 

35 Figure 4 . SI Nucleas protection assay to map the 5 1 
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start site of bGH from the Pck_N promoter. Total 
liver RNA (10/ig) was hybridized to a 280 bp 5* 
labelled probe from the Pck_NbGH gene in 40mM 
5 PIPES (Ph6.4), IMm £DTA f 400mM NaCl, 80% 

formamide at 55° overnight. The probe spanned 
from the Hinf I site in the 5 • untranslated leader 
sequence of bGH to the PvuII site 5 1 of the TATA- 
17 box. The probe includes the tet-operator 

10 sequence of Pck_N (see Figure 3) . After 

hybridization 3 00 /xl of ice cold digestion buffer 
(280mM NacL, 50Mm SODIUM ACETATE (Ph4.5), 4 . 5Mm 
ZnS0 4 / 20/xg/ml carrier DNA and 500 units SI 
nuclease) was added and incubated at 37° for 30 

15 minutes. The reaction as stopped by adding 80/xl 

of Stop Buffer (4M Ammonium acetate, 50mM EDTA 
and 50/xg/ml tRNA) , extracted with 
phenol/chloroform, precipitated with ethanol and 
analyzed on a 6% sequencing gel. The arrow 

20 indicates the protected fragment. Initiation of 

bGH mRNA from the modified Pck_N promoter occurs 
approximately 20 bp 3 1 of the TATA— box . This 
initiation site places the start of the message 
just prior to the first tetR binding site. This 

25 result indicates that the bGH mRNA starts from a 

single cap site, and suggests that tetR 
repression is due to a block in transcription 
initiation. Furthermore, unrepressed bGH 
expression appears to be due to limited tetR 

3 o expression . 

Figure 5. Nucleotide sequence of the tetR repressor 

protein gene. 
Figure 6. Alterative, non limiting promoters of 

interest. Asterisks indicate sites at which tetR 

35 operator sequence may be inserted. 

Figure 7 . Northern blot analysis of bGH mRNA in liver 
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of Fl generation animals. 

Figure 8. Northern blot analysis of bGH mRNA 
expression in four transgenic lines. 

Figure 9A. Tissue specificity of bGH expression in 
Liins 10—2 in the nrpsence of 50 ucr /ml 
tetracycline. Northern blot analysis of bGH 
induction in a variety of tissues. Only the 
liver and kidney show significant expression. 

Figure 9B. Tetracycline induction of bGH in Line 10-2 
Both liver and kidney, which are the only sites 
for bGH expression in Figure 9A, also show 
tetracycline dependent bGH expression. 

Figure 10. 345 Repressor Construct. 

Figure 11. Induction of bGH expression in Construct 
345 Offspring. Northern blot analysis of liver 
RNA from Fl animals containing the 345 construct 
Only animals from line 14 exhibit tetracycline 
dependent bGH expression. 

Figure 12. Expression and alternative processing of 
tetR transgene. A RNase protection probe which 
extends from the Nrul site of tetR 3 ■ to the end 
of the gene was used. This probe includes only 
tetR coding sequences and should give a fully 
protected fragment of approximately 400 base 
pairs. A protected fragment of approximately 
220-260 base pairs is observed, which is far 
smaller then predicted. 

Figure 13. 5" Structure of tetR mRNA. Liver RNA was 
treated with reverse transcriptase and amplified 
by PCR. The RNA was amplified using two 
different pairs of primers. The first primer 
pair (TZ-1 and TZ-4) should produce a 619 base 
pair product. The second primer pair (TZ03 and 
TZ04) should produce a 498 base pair product. 
The sequence of the primers ar : 
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TZ-1 : 5 1 CCGCATATGATCAATTCAAGGCCGAATAAG3 1 
TZ-3 : 5 1 CTTTAGCGACTTGATGCTCTTGATCTTCCA3 • 
TZ-4 : 5 • AATTCGCCAGCCATGCCAAAAAAGAAGAGG3 1 
5 The TZ-4 primer is common to both primer pairs 

and is the 5 1 primer which encompasses the start 
codon of the tetR and mRNA. Primer TZ-1 and TZ-3 
are two different 3 1 primers both of which are in 
the tetR coding region. When amplified, these 

10 primer pairs produced smaller then expected 

products (approx. 215bp vs. 619bp for TZ-4 and 
TZ-1, and approx. 94bp vs. 498bp for TZ-4 and 
TZ-3) . The products of this reaction were cloned 
and sequenced. Sequencing revealed the presence 

15 of an unexpected intron which spanned from near 

the Xbal site at the start of tetR to a splice 
acceptor just 8 base pairs 5 • of the TZ-3 primer. 
Figure 14. Composition analysis of Wild Type TnlO 
tetR gene. The TnlO tetR coding sequence was 

20 analyzed on a desktop computer using Mac Vector 

software. The figure shows a diagram of the tetR 
coding region with the plus strand splice doner 
(D) and splice acceptor (A) signal sequences 
indicated. For reference the location of the 

25 Xbal restriction is also indicated. The first 

graph depicts the percentage of G and C bases in 
the coding region of tetR. There are several 
domains of very low GC content. The bottom graph 
is an analysis of codon bias. The dark line is a 

30 comparison of the tetR codon usage to a mouse 

codon bias table. Values lower than 1.0 are 
indicative of sequences which may translate 
poorly. For reference, a comparison of tetR to a 
Tobacco codon bias table is included (light 

35 line) • In transgenic tobacco, the tetR 

regulation system functions very efficiently, 



suggesting that for this gene, codon bias may be 
an important factor for efficient expression. 

Figure 15. Synthetic tetR Component Sequences. The 
components of the synthetic tetR gene were 
synthesized by Midland Laboratories as four 
overlapping double stranded DNA cassettes. The 
sequence of these cassettes are shown. Each 
cassette was blunt cloned into the Hinc2 site of 
pUC19 and sequenced to verify authenticity. The 
resulting plasmids pLTl, pLT2, pLT3 and pLT5 can 
be used as the source material to assemble the 
entire synthetic tetR coding sequence since each 
contains an overlapping unique restriction site 
(bold face) through which they can be joined. 

Figure 16. Sequence of Synthetic tetR gene. 

Figure 17. Composition analysis of synthetic tetR. 
These graphs were produced using the same 
software described in Figure 15. The figure 
depicts the structure of the synthetic tetR gene, 
now devoid of splice donor signal sequences, with 
only a single splice acceptor signal remaining 
(A) . This is not the splice acceptor which was 
active in the 345 construct. The percentage of G 
and C bases has been significantly improved, 
while the frequency of CpG base pairs has been 
kept to a minimum. A CpG base pair is frequently 
the site for DNA methylation, which can 
negatively effect the expression of a gene. The 
codon bias of the synthetic tetR gene is also 
vastly improved. The graph depicts the results 
when the synthetic tetR coding sequence is 
compared to the same mouse codon bias table used 
previously. 
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5. DETAILED DESCRIPTION OF THE INVENTION 
For purposes of clarity of description, and not 
by way of limitation , the detailed description of the 
invention is divided into the following subsections: 

(i) the tetR operator; 

(ii) modified promoters containing the tetR 
operator ; and 

(iii) utility of the invention. 

5.1. THE TETR OPERATOR 
In order to practice the instant invention, the 
tetR operator sequence is inserted into a suitable 
animal promoter sequence in order to render that 
promoter subject to control by tetR repressor protein. 
A diagram of the tetR operator sequence is depicted in 
Figure 1. 

It may be convenient to clone the tetR operator 
into a vector, such as a plasmid or a phage, to 
facilitate its propagation. Cloned operator sequence 
may then be rendered available for insertion into a 
promoter of interest, as set forth in Section 5.2., 
infra. 

In a particular, nonlimiting embodiment of the 
invention, tetR operator sequence may be cloned as 
follows: Four oligonucleotides, which when annealed 
produce the two 19bp OP1 and OP2 palindromic sequences 
of the tetR operator may be synthesized; the sequences 
of said oligonucleotides are as follows: 
X- 1 . 5 1 ACTCTATCATTGATAGAGT3 1 
X-2 . 5 • ACTCTATCAATGATAGAGT3 1 
X-3 . 5 » TCCCTATCAGTGATAGAGA3 1 
X-4 . 5 • TCTCTATCACTGATAGGGA3 ■ 

Oligonucleotides X-l and X-2 are complementary and, 
when annealed, form the OP1 operator. Similarly, 
oligonucleotid s X-3 and X-4, when annealed, produce 
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the OP2 operator site. The OP1 oligonucleotides may 
then be directly cloned into the EcoRV site of the 
Bluescript (Stratagene) poly linker to form plasmid X. 
5 OP2 oligonucleotides may then be cloned into a Mung 
bean nuclease blunted Cial site of plasiuid X to form 
plasmid Y. The resulting tetR operator may then be 
propagated and then excised from plasmid Y as an 
EcoRI, AccI fragment which may be end-filled with T4 
10 polymerase and gel purified. 

It is preferable that the separation between OP1 
and OP2 is about 10-11 bp. 

Analogous methods may be used to insert the tetR 
operator site into other suitable vectors. 

15 

5.2. MODIFIED PROMOTERS CONTAINING 
THE tetR OPERATOR 

According to the invention, the tetR operator may 
be inserted into a suitable animal promoter so as to 

2o render that promoter subject to repression by tetR 
repressor protein. Any animal promoter maybe used; 
strategies for promoter selection are set forth in 
Section 5.3., infra . 

In preferred embodiments of the invention, the 

25 tetR operator sequence is positioned 3 ■ to the TATA- 
box sequence. A nonlimiting list of promoters which 
may be used according to the invention is set forth in 
Figure 6, together with the proximal portion of the 
promoter in the vicinity of the TATA-box, which is 

30 underlined. 

In a specific, nonlimiting embodiment of the 
invention, the tetR operator site may be inserted into 
the Nhel site of the PEPCK promoter (Wynshaw-Boris et 
al., 1984, J. Biol. Chem. 259:12161-12169). A diagram 

35 of the PEPCK promoter containing the tetR operator 
sequence of the Nhel site is presented in Figure 2 • 
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For insertion of the operator sequence, the PEPCK 
promoter may be cut with Nhel and end-filled with T4 
polymerase; tetR operator, prepared as set forth in 
5 Section 5.1., supra , may then be blunt-ligated into 
place. 

5.3. UTILITY OF THE INVENTION 
5.3.1. STRATEGY 
10 The strategy of the invention is to prepare a 

non-human transgenic animal that comprises two 
transgenes. The first transgene, termed "A," is a 
gene of interest, the expression of which is desirably 
controlled. Virtually any gene of interest may be 
15 used, including, but not limited to, growth hormone, 
hemoglobin, low density lipoprotein receptor, insulin, 
genes set forth in Table I, etc. 

TABLE 1 
Other Genes Of Interest 
Gene Disease/Affect 
ADA Adenosine deaminase Immuno-def iciency 

TNF Tumor necrosis factor Anti-cancer 
IL-2 Interleukin-2 Anti-cancer 
LDL low density hypercholesterolemia 
Factor IX hemophelia 
Factor VIII hemophelia 
/S-glucosidase Gauchers disease 

CFTR Cystic fibrosis Cystic fibrosis 

transmembrane regulator 
HPRT Hypoxanthine-guanine Lesch-Nyhan syndrome 

phosphor ibosyltransf erase 
UDP-glucuronyl transferase Crigler-Naj jar syndrome 
Growth Hormone receptor Growth 
Insulin-like growth factor Growth 
Growth hormone releasing Growth 
factor 



The expression of gene "A" is under the 
transcriptional control of promoter "B" . Promoter B 
comprises a tetR operator sequence, as discussed 
supra . Promoter B desirably defines the time and 
tissue window in which the transgsns may be induced; 
for example, promoter A may be a tissue specific 
promoter such as the PEPCK promoter (which is 
expressed selectively in liver and becomes active 
shortly prior to birth) . The second transgene encodes 
the tetR repressor, the sequence of which is set forth 
in Figure 5, 

Analysis of the TnlO tetR coding sequence 
indicates that the codon usage for this gene is poorly 
suited for expression in mammalian cells (FIG. 15) . 
To optimize tetR expression in mammalian cells a new 
tetR repressor gene was designed ( See , Section 7, 
infra ) , which may be utilized in alternative 
embodiments of the invention. The synthetic tetR gene 
(syn-tetR) is designed to encode exactly the same 
protein product as the bacterial TnlO tetR gene but 
optimizes codon usage for mammalian cells. The 
percentage of G and C bases has been significantly 
improved, while the frequency of CpG base pairs has 
been minimized. A CpG base pair is frequently the 
site for DNA methylation which can negatively affect 
the expression of a gene. In addition, the syn-tetR 
gene is devoid of any splice signals, decreasing the 
likelihood of aberrant splicing of the RNA which may 
result in production of a non-functional message. The 
sequence of the synthetic tetR gene is depicted in 
Figure 16. Plasmids comprising these sequences may be 
constructed using plasmids pLT-1, pLT-2, pLT-3 and 
pLT-5 (deposited with the American Type, Culture 
Collection (ATCC) and assigned accession numbers 
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$ , , and , as described in 

Section 7, infra. 

In further embodiments, the present invention 
5 provides for additional synthetic tetR genes from 
which one or more splice sites have been deleted or 
for which codon usage has been further optimized. 

The present invention covers synthetic tetR genes 
having the sequence set forth in Figure 16 and for 

Xo functionally equivalent variants of that sequence. 

In specific, non-limiting embodiments of the 
invention, a nuclear localization signal may be added 
to a natural or synthetic tetR gene to facilitate its 
expression ( See , Section 7, infra ) . 

15 Expression of tetR is controlled by promoter . 

While it is preferable that promoter C be the same as 
promoter B except that promoter C does not contain a 
tetR operator sequence, any promoter which provides 
expression of tetR so as to repress expression of gene 

2o "A" during the period when it is desirable to repress 
expression of W A M may be used. 

For example, and not by way of limitation, a 
transgenic animal may be produced which carries a 
first transgene which is bovine growth hormone under 

25 control of a PEPCK promoter modified to contain a 

tetR operator sequence at the Nhel site and a second 
transgene which is tetR repressor protein under the 
control of an unmodified PEPGK promoter; see Section 
6, infra. The pPCK_NbGH construct has been deposited 

30 with the ATCC and assigned accession number . 



5.3.2. TRANSGENIC ANIMALS OF 
THE INVENTION 

The binary repressor system of the invention may 

be used to control gene expression in any non-human 

transgenic animal, including, but not limited to, 

transgenic mice, pigs, goats, cows, rabbits, sheep, 
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etc. The present invention provides for such non- 
human transgenic animals earring as transgenes nucleic 
acid constructs described herein, including natural or 
synthetic tetR repressor proteins and operator 

Transgenes may be introduced by microinjection, 
transf ection, transduction, electroporation, cell gun, 
embryonic stem cell fusion, or any other method known 
in the art. The transgenes of the invention may be 
co-introduced into a single animal or may be 
introduced into two individual animals that are 
subsequently mated to produce doubly transgenic 
offspring. 

For example, for the production of transgenic 
mice, the following general protocol may be used. 
Male and female mice are mated at midnight. Twelve 
hours later, the female may be sacrificed and the 
fertilized eggs may be removed from the uterine tubes. 
Foreign DNA may then be microinjected (100-1000 
molecules per egg) into a pronucleus. Shortly 
thereafter, fusion of the pronuclei (a pronucleus or 
the male pronucleus) occurs, and, in some cases, 
foreign DNA inserts into (usually) one chromosome of 
the fertilized egg or zygote. The zygote may then be 
implanted into a pseudo-pregnant female mouse 
(previously mated with a vasectomized male) where the 
embryo develops for the full gestation period of 20-21 
days. The surrogate mother then delivers the mice and 
by four weeks transgenic pups may be weaned from the 
mother . 

According to another embodiment of the invention, 
a transgenic pig may be produced, briefly, as follows. 
Estrus may be synchronized in sexually mature gilts 
(>7 months of age) by feeding an orally active 
progestogen (e.g. allyl trenbolone, AT: 15mg/gilt/day) 



gilts may be given an intramuscular injection of 
prostaglandin (Lutalyse: 10mg/ injection) at 0800 
5 and 1600 hours. Twenty-four hours after the last day 
of AT consumption all donor gilts may be administered 
a single intramuscular injection of pregnant mare 
serum gonadotrophin (1500 U) . Human chorionic 
gonadotrophin (750 IU) may be administered to all 

10 donors at 80 hours after pregnant mare serum 
gonadotrophin . 

Following AT withdrawal, donor and recipient 
gilts may be checked twice daily for signs of estrus 
using a mature boar. Donors which exhibited estrus 

15 within 36 hours following human chorionic 

gonadotrophin administration may be bred at 12 and 24 
hours after the onset of estrus using artificial and 
natural (respectively) insemination. 

Between 59 and 66 hours after the administration 

20 of HCG one- and two-cell ova may be surgically 
recovered from bred donors using the following 
procedure. General anesthesia may be induced by 
administering 0.5 mg of acepromazine/kg of bodyweight 
and 1.3 mg of ketamine/kg via a peripheral ear vein. 

25 Following anesthetization, the reproductive tract may 
be exteriorized following a mid-ventral laparotomy. A 
drawn glass cannula (O.D. 5 mm, length 8 cm) may be 
inserted into the ostium of the oviduct and anchored 
to the infundibulum using a single silk (2-0) suture. 

30 Ova may then be flushed in retrograde fashion by 

inserting a 20g needle into the lumen of the oviduct 2 
cm anterior to the uterotubal junction. Sterile 
Dulbecco's phosphate buffered saline (PBS) 
supplemented with 0.4% bovine serum albumin (BSA) may 

35 be infused into the oviduct and flushed toward the 
glass cannula. The medium may b collected into 
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sterile 17 x 100 mm polystyrene tubes. Flushings may 
be transferr d to 10 x 60 mm petri dishes and searched 
at a lower power (5 Ox) using a Wild M3 
5 ster eomicroscope • All one- and two- cell ova may be 
washed twice in Brinster's Modified Ova Culture -3 
medium (BMOC -3) supplemented with 1.5% BSA and 
transferred to 50 fil drops of BMOC-3 medium under oil. 
Ova may be stored at 38 °C under a 90% N z , 5% O z , 5% Co 2 

10 atmosphere until microinjection is performed. One and 
two-cell ova may be placed in an Eppendorf tube (15 
ova per tube) containing 1 ml HEPES medium 
supplemented wit 1.5% BSA and centrifuged for 6 
minutes at 14,000g in order to visualize pronuclei in 

15 one-cell and nuclei in two-cell ova. Ova may then be 
transferred to a 5-10/xl drop of HEPES medium under oil 
on a depression slide. Microinjection may be 
performed using a Labor lux microscope with Nomarski 
optics and two Leitz micromanipulators. 10-1700 

20 molecules of construct DNA (linearized at a 

concentration of about Ing/jil of Tris-EDTA buffer) may 
be injected into one pronucleus in one-cell ova or 
both nuclei in two-cell ova. Microinjected ova may be 
returned to microdrops of BMOC-3 medium under oil and 

25 maintained at 38 °C under a 90% N 2 , 5% C0 2 , 5% 0 2 
atmosphere prior to their transfer to suitable 
recipients. Ova may preferably be transferred within 
10 hours of recovery. Only recipients which exhibit 
estrus on the same day or 24 hours later than the 

30 donors may preferably be utilized for embryo transfer. 
Recipients may be anesthetized as described supra . 
Following exteriorization of one oviduct, at least 30 
injected one- and/ or two-cell ova and 4-6 control ova 
may be transferred in the following manner. The 

35 tubing from a 21g x 3/4 butterfly infusion set may be 
connected to a lec syringe. The ova and one to two 
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mis of BMOC-3 medium may be aspirated into the tubing. 
The tubing may then be f d through the ostium of the 
oviduct until the tip reaches the lower third or 
5 isthmus of the oviduct. The ova may be subs quently 
expelled as the tubing is slowly withdrawn. The 
exposed portion of the reproductive tract may be 
bathed in a sterile 10% glycerol - 0.9% saline 
solution and returned to the body cavity. The 

20 connective tissue encompassing the linea alba, the 
fat, and the skin may be sutured as three separate 
layers. An uninterrupted Halstead stitch may be used 
to close the linea alba. The fat and skin may be 
closed using a simple continuous and mattress stitch, 

15 respectively. A topical antibacterial agent (e.g. 

Furazolidone) may then be administered to the incision 
area. Recipients may be penned in groups of about 
four and fed 1.8 kg of a standard 16% crude protein 
corn-soybean pelleted ration. Beginning on day 18 

20 ( da Y 0 = onset of estrus) , all recipients may be 

checked daily for signs of estrus using a mature boar. 
On day 35, pregnancy detection may be performed using 
ultrasound. On day 107 of gestation recipients may be 
transferred to the farrowing suite. In order to 

25 ensure attendance at farrowing time, farrowing may be 
induced by the administration of prostaglandin (10 
mg/injection) at 0800 and 1400 hours on day 112 of 
gestation. In all cases, recipients may be expected 
to farrow with 34 hours following PGF 2a 

3 o administration . 

As used herein, the term "transgenic animal" 
refers to animals that carry a transgene in at least 
some of their somatic cells, and preferably in at 
least some of their germ cells. 



35 



5.3.3- INDUCTION 

Induction of expression of the gene of interest 
in transgenic animals of the invention may be achieved 
by administering, to the animal, a compound that binds 
to tetR so that tetR repressor function is inhibited. 
Examples of such compounds include tetracycline and 
tetracycline-like compounds, including, but not 
limited to, apicycline, chlortetracycline, 
clomocycline, demeclocyline, guamecycline, lymecycline, 
meclocycline , methacycline , minocycline , 
oxytetracycl ine , penimepicycl ine , pipacycline , 
rolitetracycline, sancycline, and senociclin. 

Administration of the inducer can be through 
direct injection, water, feed, aerosol, or topical 
application. The choice of method will depend on the 
promoters used and the specific application of the 
transgenic animals. For example, injection, water and 
feed would provide inducer to all of the animals 
tissues. In our case, administration through water or 
feed would be the preferred method to control growth 
hormone expression in transgenic pigs. Aerosol spray 
could be used to attain high antibiotic concentrations 
in the lung. This may be appropriate for example in a 
cystic fibrosis or emphysema model. Topical 
application to the skin is also possible and could be 
used in models of acne, hair loss, wound healing or 
viral infection. 

Induction of the gene of interest is accomplished 
by administering an effective amount of inducer, as 
described above. An effective amount of inducer may 
be construed to mean that amount which increases 
expression of the gene of interest by at least about 
50 percent. As the LD 50 for tetracycline HCl in rats 
is about 6643 mg/kg and the therapeutic dose is 
between about 25-50 mg/kg, an effective dose of 
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tetracyclin , as inducer, is between about: 5-50 mg/kg 
and preferably betw en ab ut 5-15 mg/kg. 

5 6. EXAMPLE: TETRACYCLINE REPRESSOR-MEDIATED 
BINARY REGULATION SYSTEM FOR CONTROL OF 
BOVINE GROWTH HORMONE EXPRESSION IN 
TRANSGENIC MICE 

6.1. MATERIALS AND METHODS 

6.1.1. CONSTRUCTION OF PLASMIDS 

10 Plasmid p3d7 contains a functional tetR operator 

site cloned within a Bluescript (Stratagene) 

polylinker. This plasmid is useful for propagating 

the operator sequence, and as a source of operator 

sites for insertion into the PEPCK promoter or any 

15 other promoter element. The pdd7 plasmid was made as 

follows. Four oligonucleotides, which when annealed 

produce the two 19bp OP1 and OP2 palindromic sequences 

of the tetR operator were synthesized. The sequences 

of each oligonucleotide is listed below. 

20 X-1.5 1 ACTCTATCATTGATAGAGT 3' 

X-2.5 1 ACTCTATCAATGATAGAGT 3» 

X-3.5' TCCCTATCAGTGATAGAGA 3' 

X-4.5 1 TCTCTATCACTGATAGGGA 3* 

Oligonucleotides X-l and X-2 are complementary and 

25 when annealed form the OP1 operator. Similarly 

oligonucleotides X-3 and X-4 produce the OP2 operator 

site. The OPl oligonucleotides were directly cloned 

into the EcoRV site of the Bluescript polylinker. The 

resulting plasmid pSOPI was sequenced to verify the 

30 integrity of the insert. OP2 oligonucleotides were 

subsequently cloned into a Mung bean nuclease blunted 

Clal site of pSOPI to produce pdd7. Due to a cloning 

artifact produced by the Mung bean nuclease, the 

operator in pdd7 consists of the two 19bp OPl and OP2 

35 sequences separated by linker of only 10 base pairs. 

This difference does not effect tetR binding. The 



25 - 



sequence of the pddl operator sit is shown in Figure 
IB, The 55 base pair t tR operator was excised from 
pdd7 as an EcoRl, AccI fragment, end filled with T4 
polymerase, and gel purified. This fragment was 
subsequently used to produce the modified PEPCK 
promoters Pck_N and Pck_A. 

Plasmids Pck_A and Pck_N were produced by 
inserting the 55bp tetR operator into the unique AccI 
and Nhel sites (respectively) of the PEPCK promoter 
(pPCK_NbGH has been deposited with ATTC and assigned 
accession No: ) . For both plasmids the promoter was 
cut with the appropriate restriction enzyme, end 
filled with T4 polymerase and the tetR operator blunt 
ligated into place. A third modified PEPCK promoter, 
Pck_T was produced in which the OP1 and OP2 operator 
sequences were positioned to flank the PEPCK TATA— box 
element. To produce Pck_T a new oligonucleotide 
( 5 • ACTCTATCATTGATAGAGTTACTAT 

TTAAATCCCTATCAGTGATAGAGA3 ■ ) was produced. This 
oligonucleotide was kinased with T4 polynucleotide 
kinase and annealed to kinased X-2 and X-4 which are 
complementary to the first and last 19bp. The 
complete double stranded 49bp operator was produced by 
filling in the llbp linker region, which includes the 
PEPCK TATA-box element, with Klenow. The final 
product was then blunt cloned into an AccI, Nhel cut 
PEPCK promoter. All three modified promoters were 
sequenced to verify the inserts. Figure 2 depicts the 
structure of these promoters. 

6.1.2. REPRESSOR CONSTRUCT 
Plasmid pBISOl contains a 701 bp Hindi fragment 
from E. coli TnlO, cloned into the Hindi site of 
pUC8. The Hindi insert contains the entire tetR 
coding sequence along with 21bp of 5 1 and 55bp of 3 1 
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untranslated DNA. This insert was excised from the 
parent plasmid and subcloned into a plasmid with a 
more suitable polylinker to produce pSTET7 . To this 
5 plasmid a 870bp Xhol, BamHI fragment derived from pMSG 
(Pharmacia) , containing the SV40 small-T intron and 
polyadenylation signal sequences was inserted at the 
Hindu site 3 1 of the tetR coding region to produce 
pSTetRSv. Finally an unmodified 610bp PEPCK promoter 

10 was inserted at the EcoRl site of pSTETRSv to produce 
pPck_tetRSv. The PEPCK promoter is identical to the 
promoter used to produce pPck_A, pPck__N, and pPck_T 
except that it does not contain a tetR operator site. 
This PEPCK promoter has been previously used in 

15 transgenic animals and is known to target gene 
expression specifically to the liver. 

6. 1.3. GROWTH HORMONE GENES 
Plasmid pGH-SAF107 contains a 2.2kb BamHI, EcoRI 

2o genomic fragment of the bovine growth hormone (bGH) 
gene, blunt ligated into an EcoRV site. To this 
vector each of the modified PEPCK promoters was added 
by blunt ligating the promoter into the BamHI site of 
pGH-SAF107. The structure of the resulting plasmids 

25 is depicted in Figure 3. Plasmid pPCK_NbGH was 

deposited with the ATCC and assigned accession number 

. For production of transgenic animals, 

each of the PEPCK— bGH genes was excised from the 
vector using Xhol and Sacl, gel fractionated and 

30 purified using an Elutip column. 

6.1.4. TRANSGENIC MICE 
Transgenic mice were made which contain both the 
Pck_tetRSv gene and one of the modified PEPCK 
35 promoters controlling bGH. Table 2 lists the number 
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of eggs injected, offspring produced and number of 



transgenics derived 


for ach construct. 








TABLE 2 










Eggs 


Eggs 


Live 




Construct 


injected 


transferred 


Born 


Transgenic 


Pck AbGH + Pck tetRSv 


233 


194 


40 


14 (0.35) 


(251) 








Pck— NbGH + Pck tetRSv 


268 


208 


30 


9(0.3) 


(252) 










Pck TbGH + Pck-tetRSv 


227 


197 


25 


5(0.2) 


(261) 










6.2. 


RESULTS AND 


DISCUSSION 







Once the transgenic founder animals were 
identified, they were weighed each week. Table 3 
lists the mean weights of each group of transgenic 
animal at 11 weeks of age. 

TABLE 3 



Construct 


Sex 


Height 


Pck_ 


AbGH + Pck_tetRSv(9) 


male 


36 


.122 (12.251) 


Pck_ 


AbGH + Pck_tetRSv(4) 


female 


29 


.125(7.861) 


Pck_ 


_NbGH + Pck_tetRSv(5) 


male 


34 


.840(14.745) 


Pck 


NbGH + Pck_tetRSv(4) 


female 


28 


.125(10.958) 


Pck 


TbGH + Pck_tetRSv(3) 


male 


36 


.267 (11.402) 


Pck 


TbGH + Pck_tetRSv(2) 


female 


27 


.300(5.798) 


NON- 


-TRANSGENIC (6) 


male 


29 


.583 (2.395) 


NON- 


-TRANSGENIC (6) 


female 


23 


.117(1.863) 



As expected for each co-injection, large animals, 
obviously expressing elevated levels of bGH, were 
observed as were animals of normal stature. 
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At 10 weeks of age, a sampling of transgenic 
female founders containing the A+T and N+T were tested 
for induction of bGH in the serum using a radio-immune 
5 assay, after a single IP injection of 60 mg/kg 

tetracycline-HCl. The purpose of this experiment was 
simply to determine which if either of these two 
modified promoters was responsive to repression by 



tetR. The 


results are 


summarized in Table 


4. 








TABLE 4 








Construct 


Animal 


Weight 


Basal 


12 hours 


36 hours 


249 


2-5 female 


21.1 


0.00 


0.00 


0.00 


250 


6-6 female 


42.9 


4.6+0.033 


3.4+0.062 


4. 9+0;. 072 


251 


6-6 female 


19.3 


0.00 


0.00 


0.00 


251 


10-5 female 


25-1 


0.20+0.008 


0.19+0.001 


0.21+0.038 


252 


5-2 female 


38.7 


0.59+0.107 


0.64+0.044 


1.12+0.207 


252 


5-3 female 


20.0 


0.00 


0.00 


0.00 


252 


10-2 


19.2 


0.00 


0.00 


0.00 



No induction of bGH was observed in animals that lack 
the Pck_tetRSV gene (construct 2 50) or in animals with 

25 both the Pck_AbGH + Pck-tetRSv genes (construct 251) . 
An approximate two fold increase in serum bGH levels 
was however detected in the 5-2 female which contains 
the Pck-NbGH + Pck_tetRSV genes. The remainder of the 
animals had undetectable levels of bGH expression, due 

30 in part to the relatively low sensitivity of this 
assay. m For example the 10-2 female (construct 252) 
shows no detectable bGH in the serum, but subsequent 
experiments on her offspring indicate that this line 
of animals does express bGH mRNA in a tetracycline 

35 dependent manner. This initial data, suggested that 
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th Pck_N promoter was being r gulated by tetR at 
1 ast to a limited extent. 

To further characterize the mice, improve the 
5 sensitivity of the assay and to test the 

responsiveness of the Fck_T promoter, offspring of 
founder mice from each co-injection were produced. 
The transgenic progeny were then raised in the 
presence or absence of tetracycline medicated water 

10 (500/xg/ml) for 4 weeks, prior to analysis of bGH mRNA 
expression levels in the liver, the predominant site 
of PEPCK expression. Northern blot hybridization 
analysis of these animals (Figure 7) demonstrated 
again, that animals with the Pck_NbGH gene were 

25 responsive to repression by tetR and that the other 
two modified promoters exhibited no signs of tetR 
dependent regulation. 

We attempted to breed all of the remaining 
founders containing the Pck-NgGH + Pck_tetRSv genes to 

20 analyze their offspring in a similar manner (Figure 

8) . Of the 5 founders which produced offspring, 2 did 
not express bGH under any conditions, and from the 
remaining 3 one segregated two different integration 
sites allowing us to establish a total of 4 lines. 

25 All 4 lines exhibited tetracycline dependent bGH 

expression as assayed by Northern blot hybridization. 
The efficiency of tetR repression appeared to be 
inversely correlated with the level of expression. 
For example 9-5 animals have the highest level of bGH 

30 expression, show an obvious increase in body size, and 
exhibit only marginal tetR repression. In contrast 9- 
4Lc and 10-2 animals exhibit lower levels of 
tetracycline induced bGH expression, are of normal 
stature and appear to be efficiently regulated by 

35 t tR. 
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An SI nucl ase protection assay was performed to 
identify the start site of transcription of bGH mRNA. 
As shown in Figure 4, there was only one start site 
identified regardless of the presence or absence of 
tetR repressor binding. This start site was located 
approximately 20 bp downstream from the TATA-box. At 
this location, the message is initiating within the 
337 operator sequence, just 3 or 4 base pairs 5* of 
the first tetR binding site. 

7. EXAMPLE: OPTIMIZATION OF tetR CODING SEQUENCE 
The use of the wild type TnlO tetR gene in 
conjunction with the 252 construct indicates that the 
TetR system can function in transgenic animals and 
that in some cases, for instance in the 10-2 
transgenic animals, the level of regulation can be 
very high (FIGS. 9 A and 9B) . However, in other 
instances the efficiency of repression is not always 
complete, leading to a significant basal level of bGH 
expression. This failure to repress may be due to low 
level expression of tetR. To optimize the expression 
of tetR repressor, a synthetic tetR gene was generated 
which was devoid of splice signals and had optimized 
codon usage for mammalian cells. 



7.1 MATERIALS AND METHODS 

7.1.1. TISSUE SPECIFICITY AND TETRACYCLINE 
INTRODUCTION OF bGH IN L INE 1Q-2 

30 

For all Northern blots 10/ig of whole RNA was 
electrophoreses through a 1% agarose gel containing 3% 
formaldehyde using standard techniques. To detect bGH 
mRNA a random primed, radioactive bGH cDNA probe was 
*c used. All conditions for hybridization and washing of 
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filters were done in accordance with standard 
techniques of mol cular biology. 

5 7* 1.2. EXPRESSION AND ALTERNATIVE PROCESSING 

OF THE tetR TRANSGENE 

A RNase protection probe which extended from the 
Nrul site of tetR 3' to the end of the gene was used. 
This probe includes only tetR coding sequences and 

10 should give a fully protected fragment of 

approximately 400 base pair. When hybridized to 150jxg 
of liver RNA (500,000 cpm of probe in a 30/ul 
hybridization consisting of 80% formamide, 40mM PIPES 
pH 6.4, 400mM NaOAc , and ImM EDTA) , and digested with 

15 RNase one (Pr omega) for 30 minutes at 37° as 

recommended by the manufacturer, a protected fragment 
of approximately 221-260 base pairs is observed, far 
smaller than predicted. 

20 7.1.3. 5' STRUCTURE OF tetR mRNA 

Liver RNA was treated with reverse transcriptase 
and amplified by PCR using the manufacturers 
recommended conditions (Pharmacia) . The RNA was 
amplified using two different pairs of primers. The 

25 first primer pair (TZ-1 and TZ-4) should produce a 619 
base pair product. The second primer pair (TZ-3 and 
TZ-4) should produce a 498 base pair product. The 
sequence of the primers are: 
TZ-1 : 5 • CCGCATATGATCAATTCAAGGCCGAATAAG3 ■ 

30 TZ-3 : 5 • CTTTAGCGACTTGATGCTCTTGATCTTCCA3 ■ 
TZ-4 : 5 • AATTCGCCAGCCATGCCAAAAAAGAAGAGG3 1 

The TZ-4 primer is common to both primer pairs 
and is the 5' primer which encompasses the start codon 
of the tetR mRNA. Primer TZ-1 and TZ-3 are two 

35 different 3 • primers both of which are in the t tR 
coding r gion. When amplified, these primer pairs 
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produce small r than expected products (approx. 215bp 
vs. 619bp for TZ-4 and TZ-1, and approx. 94bp vs. 
498bp for TZ-4 and TZ-3) . The products of this 
5 reaction were cloned and sequenced. The sequence 
revealed the presence of an unexpected intron which 
spanned from near the Xbal site at the start of tetR 
to a splice acceptor just 8 base pairs 5 • of the TZ-3 
primer. 

10 

7.1.4. 345 REPRESSOR CONSTRUCT 

In an embodiment of the invention, any nuclear 
localization signal may be added to a natural or 
synthetic tetR gene to facilitate its expression. For 

15 example, complementary oligonucleotides which encode a 
nuclear localization signal sequence were synthesized 
(Oligos etc.) and added in frame to the tetR coding 
sequences of pSTETR107 at the EcoRl and Xbal 
restriction sites to produce pNTETR. Oligonucleotide 

20 sequences are: 

( GB1 ) 5 ■ AATTCGCCAGCCATGCCAAAAAAGAAGAGGAAGGTAT3 • and 
( GB2 ) 5 • CTAGATACCTTCCTCTTCTTTTTTGGCATGGCTGGC3 1 . 
When annealed these oligonucleotides have a 5 1 EcoRl 
and 3' Xbal compatible overhangs. These 

25 oligonucleotides fuse the amino acid sequence Met Pro 
Lys Lys Lys Arg, Lys Val,to the third amino acid (Arg) 
of wild type tetR. 

Two complementary 51 base pair oligonucleotides 
which start the 5 1 cap site of bGH and extend to the 

30 first exon were synthesized (Oligos etc.). Sequence 
for the oligonucleotides are (5b-l) : 

5 • GATCCCAGGACCCAGTTCACCAGACGACTCAGGGTCCTGTGGACAGCT 
CAG3 1 

and (5b-2) : 

35 5 • AATTCTGAGCTGTCCACAGGACCCTGAGTCGTCTGGTGAACTGGGTCC 
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TGG3 • . When annealed these oligonucleotides have 5 • 
BamHl and 3» EcoRl compatible overhands. The 
oligonucleotides for the 5 1 leader sequence of bGH 
5 were cloned into a BamHl, EcoRl cut plasmid to produce 
p5 ; GH. 

The nuclear localization modified tetR coding 
sequence was isolated by gel purification after 
restriction digestion of pNTETR using EcoRl and Hind 

10 III. This fragment was then inserted into pS^GH at 
the EcoRl and Hind III sites to product pS'GHTR. 

To add the remainder of the bGH genomic sequence 
an intermediate modification of p5 1 GHTR was first 
made. This modification consisted of adding a 

15 Hind III - Pstl linker to the Hind III site of p5 1 GHTR 
to product pGTO. The sequence of the oligonucleotides 
which comprise this linker are: (CC-1) 
5 1 AGCTTCTGCAG3 • and ( CC-2 ) 5 1 AGCTCTGCAGA3 1 . The 
remaining bGH genomic sequences were added in two 

20 steps. First the Pstl Sac2 fragment that begins in 

the first exon of bGH and ends in the third intron was 
excised from pSGH107. Similarly, the insert of pGTO 
which contains the 5' untranslated leader of bGH and 
the nuclear localization modified tetR was excised 

25 using BamHl and Pstl. These two gel purified 

fragments was then cloned into a BamHl Sac2 cut vector 
to produce pGTG. Finally, the remainder of the bGH 
gene from the Sac2 site in the third intron to the end 
of the gene, was added to pGTG by cutting pGTG with 

30 Sac2 and adding the Sac2 fragment from pSGH106 to 
produce pNTETR-GH. 

Plasmid pNTETR-GH was digested with BamHl to 
excise the NTETR-GH gene. The fragment was cloned 
into the BamHl site of pPCK 305 to produce the final 

35 plasmid pPCK— GHNTET . To produce transgenic mice, the 
PEPCK— GHTET gene was excised from the plasmid using 
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Sail and Sacl. This fragment was gel purified and 
coinjected with the PCK-NbGH gene previously described 
to generate transgenic mice. 

5 v " 

7. 1,5. SYNTHETIC tetR COMPONENT SEQUENCES 

The components of the synthetic tetR gene were 

synthesized by Midland Laboratories as four 

overlapping double stranded DNA cassettes. The 

lO sequence of these cassettes are shown in Figure 15. 
Each cassette was blunt cloned into the Hinc2 site of 
pUC19 and sequenced to verify authenticity. The 
resulting plasmids pLTl, pLT2, pLT3 and pLT5 can be 
used as the source material to assemble the entire 

15 synthetic tetR coding sequence since each contains an 
overlapping unique restriction site (bold face) 
through which they can be joined (pLT-1, pLT-2, pLT-3 
and pLT-5 have been deposited with ATCC and have been 
assigned accession numbers , , , and 

20 respectively) . There are many possible ways by which 
these cassettes can be joined. By way of an example, 
the inserts of plasmid pLTl and pLT2 can be excised 
using EcoRl and Nsil. The inserts can then be 
combined by cloning these two fragments into an EcoRl 

25 vector. This procedure will assemble the 5 1 half of 
the gene, using the overlapping Nsil restriction site 
to join the pieces. Similarly, the 3* half of the 
gene can be assembled from pLT3 and pLT5 by cutting 
with EcoRl and Sphl (pLT3) and Sphl and Hind III 

30 (P LT5 ) to release the inserts. These inserts can then 
be joined at the overlapping Sphl site by cloning the 
fragments into an EcoRl, Hind III cut vector. 
Finally, the entire coding region can be put together 
using the overlapping restriction site ApaLl. This 

35 would result in a vector with the synthetic tetR 
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coding sequence, as depicted in Figure 16, cloned into 
a plasmid as an EcoRl Hind III fragment. 

5 7.1.6. COMPOSITIONAL ANALYSIS OF 

WILD TYPE TnlO tetR GENE 

The TnlO tetR coding sequence was analyzed on a 
desktop computer using Mac Vector software. Figure 14 
shows a diagram of the tetR coding region with all of 

10 the plus strand splice doner (D) and splice acceptor 
(A) signal sequences indicated. For reference the 
location of the Xbal restriction is also indicated* 
The first graph depicts the percentage of G and C 
bases in the coding region of tetR. There are several 

15 domains of very low GC content. The bottom graph is 
an analysis of codon bias. The dark line is a 
comparison of the tetR codon usage to a mouse codon 
bias table. Values much lower than 1.0 are indicative 
of sequences which may translate poorly. For 

20 reference, a comparison of tetR to a Tobacco codon 
bias table is included (light line) . In transgenic 
tobacco, the tetR regulation system functions very 
efficiently, suggesting that for this gene, codon bias 
may be an important factor for efficient expression. 

25 

7.1.7. COMPOSITIONAL ANALYSIS OF SYNTHETIC tetR 
Figure 17 depicts the structure of the synthetic 
tetR gene, now devoid of splice donor signal 
sequences, with only a single splice acceptor signal 

30 remaining (A) . This is not the splice acceptor which 
was active in the 345 construct • The percentage of G 
and C bases has been significantly improved, while the 
frequency of CpG base pairs has been kept to a 
minimum. A CpG base pair is frequently the site for 

35 DNA methylation, which can negatively effect the 

expression of a gene. The codon bias of the synthetic 
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tetR gene is vastly improved. The graph depicts the 
results when the synthetic tetR coding sequence is 
compared to the same mouse codon bias table used 
5 previously. 

7.2 RESULTS 

7.2.1. EXPRESSION OF tetR IN CONSTRUCT 345 OFFSPRING 
To improve tetR expression a new repressor 

xo construct was produced. The construct , referred to as 
Construct 345 is depicted in Figure 10. In the 345 
construct the coding region of tetR is augmented with 
a nuclear localization signal sequence to increase the 
nuclear concentration of repressor. The tetR coding 

1S region was inserted into the first exon of the bGH 
gene. The bGH gene then acts as a genomic carrier, 
providing multiple introns, which may improve 
expression, and a strong polyadenylation signal, which 
may improve the processing and stability of the 

20 message. 

The new repressor was coinjected with the bGH 
gene from construct 252. The resulting transgenic 
animals contain the new repressor, and a PEPCK 
regulated bGH gene with the tetR operators located 

25 just 3 1 of the PEPCK TATA-box element. Offspring of 
these animals were screened for bGH induction (FIG. 
11). Of the lines tested only one, line 14, showed 
tetracycline dependent regulation of bGH, and in this 
one case there was still a significant base level of 

30 bGH expression. Northern analysis, performed to 
determine the levels of tetR mRNA expressed in the 
transgenic mice, indicated that the tetR gene was 
still not expressed at a high level. 

To detect tetR mRNA with higher sensitivity the 

35 tetR mRNA was analyzed using RNase protection. This 
technique revealed that the mRNA was shorter then 
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expected (FIG. 12) . Subsequent analysis using reverse 
transcriptase-PCR with primers that amplify the entire 
coding region of tetR confirmed that the mRNA was 
5 significantly shorter then expected (FIG. 13) . 

Sequence analysis cf these RT-PCR products indicated 
that an unexpected splicing event had occurred. This 
splicing process occurred between a splice donor 
signal in the 5' end of the tetR coding region and a 
10 splice acceptor approximately 400 bp 3 9 of the start 
codon. The resulting mRNA is therefore deleted of the 
tetR DNA binding domain and about two third of the 
entire coding region. This mRNA could not possibly 
make a functional repressor. 

15 

7.2.2. OPTIMIZATION OF tetR CONSTRUCT 
A more detailed analysis of the tetR coding 
sequence indicated that the codons used in this gene 
are poorly suited for expression in mammalian cells 

20 (FIG. 14) . Therefore, it appears that the 

inefficiency of the tetR system is the result of two 
processes: (i) aberrant splicing of the RNA to 
produce a nonfunctional message; and (ii) inefficient 
translation which can lead to rapid mRNA turnover. 

25 To circumvent the problems of internal splicing 

and potential problems due to codon bias and G-C 
content, a synthetic tetR gene was designed. The 
components of the synthetic tetR gene were synthesized 
as four overlapping double stranded cassettes. Each 

30 cassette was cloned in pucl9. The resulting plasmids 
designated pLT-1, pLT-2, pLT-3 and pLT-5, as depicted 
in Figure 15, have been deposited with ATCC and 

assigned accession numbers , , , and 

, respectively. The synthetic tetR (syn-tetR) 

35 has been designed to encode exactly the same protein 
product, but is devoid of splice signals and has 
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greatly improved codon usage for mammalian cells. The 
sequence of the of the syn-tetR is indicated in Figure 
16. The predicted analysis for splicing signals, G+C 
5 content, and codon usage are depicted in Figure 17. 

8. DEPOSIT OF MICROORGANISMS 
The following microorganisms have been deposited 
with the American Type Culture Collection, (ATCC) , 
10 Rockville, Maryland and have been assigned the 
following accession numbers: 

Mi cr ooraan i sm Date of Deposit Accession No. 

pLT-1 August 25, 1993 

pLT-2 August 25, 1993 

15 pLT-3 August 25, 1993 

pLT-5 August 25, 1993 

pPCK_NbGH August 25, 1993 

The present invention is not to be limited in 
scope by the microorganisms deposited since the 
20 deposited embodiments are intended as illustrations of 
single aspects of the invention and any microorganisms 
which are functionally equivalent are within the scope 
of the invention. 

The present invention is not to be limited in 
25 scope by the exemplified embodiments which are 

intended as illustrations of single aspects of the 
invention, and any clones, DNA or amino acid sequences 
which are functionally equivalent are within the scope 
of the invention. Indeed, various modifications of 
30 the invention in addition to those described herein 
will become apparent to those skilled in the art from 
the foregoing description and accompanying drawings. 
Such modifications are intended to fall within the 
scope of the appended claims. 
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It is also to be understood that all base pair 
sizes given for nucleotides are approximate and are 
used for purposes of description. 

Various publications are cited herein, which are 
hereby incorporated by reference in their entirety = 



10 



15 



20 



25 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Byrne, Guerard 

(ii) TITLE OF INVENTION: TETRACYCLINE REPRESSOR-MEDIATED BINARY 
REGULATION SYSTEM FOR CONTROL OF GENE EXPRESSION IN 
TRANSGENIC ANIMALS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Pa tent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/935,763 

(B) FILING DATE: 26-AUG-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE/DOCKET NUMBER: 6794-025 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 790-9090 

(B) TELEFAX: 212 869-8864/9741 

(C) TELEX: 66141 PENNIE 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TTGACACTCT ATCATTGATA GAGTTATTTT ACCACTCCCT ATCAGTGATA GAGAAAAGT 59 
(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GAATTCGATA CTCTATCATT GATAGAGTAT CAAGCTTATC CCTATCAGTG ATAGAGATAC €0 
CGTCGACCTC 70 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACTCTATCAT TGATAGAGTT ACTATTTAAA TCCCTATCAG TG AT AG AG A 49 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGAATTCGAT ACTCTATCAT TGATAGAGTA TCAAGCTTAT CCCTATCAGT GATAGAGATA 60 
CCGTCGACCT C 71 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1-.624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 48 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn S jt Ala Leu Glu Leu 
1 5 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 9 c 
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Leu Aen Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA . AAA AAT AAG 144 
Lye Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp Hie Val Lye Asn Lys 
35 40 45 

CGG GCT TTG CTC GAG GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 240 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAC GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 288 
Asn Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 432 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 480 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 ISO 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 576 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCT TAA 624 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
1.5 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr A-c Lys Leu Ala Gin 
20 25 " 30 



Lys Leu Gly Val Glu 
35 



Gin Pro Thr Leu Tyr Trp His Val Lys Asn 
40 45 



Lys 
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Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met: Leu Asp Arg His His 
50 55 60 

Thr Hie Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

Aon Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGGCCCTATA AAAAGCGAAG CGCGCGGCGG GCGGGAGTCG CTGCGTTGCC TTCGCCCCGT 60 
GCCCCGCTCC GCGCCGCCTC GCGCCGCCCG CC 92 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

( ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAGAAGTATA TTAGAGCGAG TCTTTCTGCA CACACGATCA CCTTTCCTAT CAACCCCACT 60 
A 61 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTATTATGTT TTATGTTACT GTAAAAGATG TAAAGAGAGG CACGTGGTTA AGCTCTCGGG 60 
GTGTGGACTC CACC 74 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCCCCAAGC ATAAACCCTG GCGCGCTCGC GGCCCGGCAC TCTTCTGGTC CCCACAGACT 60 
CAGAGAGAAC CCA 73 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAGGCAGCAG GCATATGGGA TGGGATATAA AGGGGCTGGA GCACTGAGAG CTGTCAGAGA 60 
TTTCTCCAAC CCAG 74 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
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ACTCTATCAT TGATAGAGT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D j TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION i SEQ ID NO: 13: 
ACTCTATCAA TGATAGAGT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCCCTATCAG TGATAGAGA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCTCTATCAC TGATAGGGA 
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Intemational Application No: PCT7 



MICROORGANISMS 

Optional Sheet in connection with the microorganism referred to on page 38, lines 7-23 of the description ' 



A. IDENTIFICATION OF DEPOSIT ' 

Further deposits are identified on an additional sheet 



Name of depositary institution * 
American Type Culture Collection 



Address of depositary institution (including postal code and country) ' 

12301 Perklawn Drive 
Rocfcville. MD 10582 
US 



Date of deposit * August 25. 1993 Accession Number * N/A 



B. ADDITIONAL INDICATIONS * n««e bUnk if oat spoBombk). Thb i 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ' 



D. SEPARATE FURNISHING OF INDICATIONS ' (taw bhnk if not appiksbk) 



The indications listed below will be submitted to the Irnemstional Bureeu later • fSpectry the Q*ner«J nature of the indications < 
"Accession Number of Deposit*) 



E. L^This sheet was received with the International application when filed (to be checked by the receiving Office) 

horifced Officer) Jj 



(Authorised Officer) 



□ The date of receipt (from the applicant) by the International Bureau * 



was 

Form PCT/RO/134 (January 1981) 



(Authorized Officer) 
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Internetional Application No: PCT/ / 

Form PCTYRO/1 34 (cont.) 

American Type Culture Collection 

12301 Parkiawn urive 
Rocfcviile, MD 10582 



US 



Accession No. 



N/A 



Date of Deposit 
August 25, 1993 



N/A 



August 25, 1993 



N/A 



August 25, 1993 



N/A 



August 25, 1993 



nucleic acid mol cul comprising an animal promoter 
5 element that comprises a tetR operator sequence. 

2. The nucleic acid molecule of claim 1 in 
which the tetR operator sequence is positioned 3 • to a 
TATA-box sequence. 

10 

3. The nucleic acid molecule of claim 1 in 
which the promoter element is the PEPCK promoter. 

4. The nucleic acid molecule of claim 3 in 
15 which the tetR operator sequence has been inserted 

into the Nhel site of the PEPCK promoter element. 

5. The nucleic acid molecule of claim 1, 2, 3 
or 4 in which the promoter element controls the 

20 expression of a gene of interest. 

6. The nucleic acid molecule of claim 5 in 
which the gene of interest is bovine growth hormone. 

25 7. A non-human transgenic animal that carries, 

as a transgene, the nucleic acid molecule of claim 1, 
2, 3 or 4. 

8. A non-human transgenic animal that carries, 
3 q as a transgene, the nucleic acid molecule of claim 5. 

9. A non-human transgenic animal that carries, 
as a transgene, the nucleic acid molecule of claim 6. 
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10. The non-human transgenic animal of claim 7 
that further carries a transgene encoding the tetR 
repressor protein. 

5 

11. The non-human transgenic animal of claim 8 
that further carries a transgene encoding the tetR 
repressor protein. 

10 12. The non-human transgenic animal of claim 9 

that further carries a transgene encoding the tetR 
repressor protein. 

13 . A non-human transgenic animal that carries a 
15 transgene encoding the tetR repressor protein. 

14 . A method of selectively inducing the 
expression of a gene of interest in a non-human 
transgenic animal comprising administering a 

2o tetracycline compound to a non-human transgenic animal 
that carries a first transgene which is a gene of 
interest under the control of a promoter element 
modified to comprise a tetR operator sequence and a 
second transgene encoding the tetR repressor protein. 

25 

15. A non-human transgenic animal that carries 
(i) a first transgene that encodes bovine growth 
hormone and is under the control of PEPCK promoter 
element modified to contain a tetR operator at the 

30 Nhel site; and (ii) a second transgene that encodes 
tetR repressor protein. 

16. The transgenic animal of claim 15 that is a 
mouse . 

35 



nucleic acid molecule comprising an optimized tetR 
gene as depicted in Figure 16. 

19. The non-human transgenic animal of claim 7 
10 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

20. The non-human transgenic animal of claim 8 
15 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

21. The non-human transgenic animal of claim 9 
20 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

22. A non-human transgenic animal that carries 
25 an optimized transgene encoding the tetR repressor 

protein and having a sequence as depicted in Figure 
16. 

23. A method of selectively inducing the 
30 expression of a gene of interest in a non-human 

transgenic animal comprising administering a 
tetracycline compound to a non-human transgenic animal 
that carries a first transgene which is a gene of 
interest under the control of a promoter element 
35 modified to comprise a tetR operator sequence and a 
second optimized transgene encoding the tetR repressor 
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protein and having a sequence as depicted in Figure 
16. 

5 24. A non-human transgenic animal that carries 

(i) a first transgsns that encodes bovine growth 
hormone and is under the control of PEPCK promoter 
element modified to contain a tetR operator at the 
Nhel site; and (ii) a second optimized transgene that 
l0 encodes tetR repressor protein that has a sequence as 
depicted in Figure 16. 

25. The transgenic animal of claim 24 that is a 
mouse • 

15 

26. The transgenic animal of claim 24 that is a 

pig. 



20 



25 



30 



35 
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EcoRl OP1 linker 

ggaattcgat-ACT CTA TCA TTG ATA GAG TAT CAA GCT TAT CCC 

OP2 AccI 

TAT CAG TGA TAG AGA-taccgtcgacctc 



±6. 
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MODIFIED PEPCK PROMOTERS 




261 (Flank TATA) 

( MYA n 
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Si Nuclease Protection: 5' Start Site 

a 252:9-5 252:9-4 252:9-4 
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10 20 30 40 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC C-CA TTA GAG 

wr£RLDKSKVlN S_A L^ E> 

" TETRACYCLINE REPRESSOR PROTEIN (TETR) ; OOuwS^pxAn**^ > 

b h b TETR REPRESSOR MRNA [ SPLIT] _b P P P > 

< aZl520ZAlZa^903 TO 1526 OF TRNlOTETR__^1490_a a 



50 



60 
* 



70 



80 



90 



CTG CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA AOC CGT AAA CTC 
LLNEVG IEGLTT RK L> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODO!^START=:l^ ?• 

~ ~ b b t> TETR REPRESSOR MRNA [SPLIT] _b b P-_- P > 

^ 4 fm V a 1 470_903 TO 1526 OF TRN10TETR_0_a a 1440a 



100 HO 120 130 

GCC CAG AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA 
AOKI*GVEQPTLiWnv> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODOfL ! START=l_ > 

b b b TETR REPRESSOR MRNA [ SPLIT P P P > 

— Xl430__iZIa_903 TO 1526 OF TRN10TETR__a_1400_a a 



140 



150 



160 



170 



180 



AAA AAT AAG CGG OCT TTG CTC GAC GCC TTA GCC ATT GAG ATC TTA 
1C NKRALLDALAI E M l+> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODC»^START=l^ > 

_H_ b TETR REPRESSOR MRNA ( SPLIT] _b P. P. -P > 



<1390_A. 



_1380_903 TO 1526 OF TRN10TETR_0_a a 1350J 



190 



200 



210 



220 



GAT AGG CAC CAT ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG 

drhhthfcpleges W> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODOi^START=l > 

b K b TETR REPRESSOR MRNA [S PLIT ] _p P P. P > 

< ZrTCa.C\ a a 9 03 TO 1526 OF TRN10TETR — a_1310 — Z 



230 



240 



250 



260 



270 



CAA GAT TTT TTA CGT AAT AAC GCT AAA AGT TTT AGA TGT GCT TTA 

odflrnnaksfrcau> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=l_ > 

b b b TETR REPRESSOR MRNA [ SPLIT ]_D p > 

<130bta_SZIl290_903 TO 1526 OF TRN1 0TETR_0_a a 1260a 



280 

* 



290 



300 



310 



CTA AGT CAT CGC GAT GGA GCA AAA GTA CAT TTA GGT «CA *-G« CCT 

lshrdgakvhlgtr p> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=3 _> 

b b b T ETR REPRESSOR MRNA i SPLIT] _b p b b > 

< ^ 1250 a a 903 TO 1526 OP TRNIOTETR — a_1220 — a a 
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320 



330 



340 



350 



360 



ACA GAA AAA CAG TAT GAA ACT CTC GAA AAT CAA TTA GCC TTT TTA 
TEKQVETLENQLAFL> 

T STPACYCLTg RE?P^S?0' D PROTEIN (TETR) ; CODON_START=l > 

T ETR REPRESSOR MRNA [SPLIT]_b b b b > 

<1210 a, a 1200 903 TO 1526 OF TRN1 0TETR_0_a a 1170a 



370 



380 



390 



400 



TGC CAA CAA GGT TTT TCA CTA GAG AAT GCA TTA TAT GCA CTC AGC 

r G OGFSLENALYA L S> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); CODQN^TART=^ > 

b V- , b T ETR REPRESSOR MRNA [ SPLIT] _b P P p > 

<~ a"Tl60 a a 903 TO 1526 OF TRNl0TETR_a_1130 — a a . 



410 



440 



450 



420 430 

* * * 

GCT GTG GGG CAT TTT ACT TTA GGT TGC GTA TTG GAA GAT CAA GAG 
^V GHFTX-GCVLE^y 1 ^ 
TETRACYCLINE REPRESSOR PROTEIN (TETR) ,- CODON-START^l 

v, t) h T ETR REPRESSOR MRNA [SPLIT] P. *>_--* 

<1120 k_a_iri0_903 TO 1S26 OF TRN10TETR_0_a a 1080« 



460 



470 



480 



490 
* 



CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA CCT ACT ACT GAT AGT 

" -TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=1^ > 

w b V) TETR REPRESSOR MRNA [SPLIT] _b b P P > 

<i al070 — a a 903 TO 1526 OF TRN10TETR__a_1040 — a a 



500 



510 



520 



S30 



540 



ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA TTT GAT CAC CAA 

M TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=l__ > 

"b V > h T ETR REPRESSOR MRNA [ SPLI T] _b b P— -P > 

<l03o"a a_1020_903 TO 1526 OF TRNlOTETR_0_a a a990a 



550 



560 



570 



580 



GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG ATC ATA TGC 

G TETRACYCLINE REPRESSOR^PROTEIN (TETR) ; CODOtLSTART = 1 > 

b b b TETR REPRESSOR MRNA [SPLIT] _b P P. P > 

< a~980 a a_ 903 TO 1526 OF TRN10TETR — a — 950 — a a 



590 



600 



610 



620 



GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCT TAA 
rLEKQLKCESGS > 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODOH : 

V> b ~ETR REPRESSOR MRNA [SI ^IT]_J= b b 

_j~> u ^ _910_a_ 



<940_a. 



903 TO 1526 OF TRN10TETR 910_a a 
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INDUCTION OF BOVINE GROWTH HORMONE 
mRNA BY TETRACYCLINE 



CONSTRUCT 
ANTIBIOTIC 



2S2 



252 



I NT He 9-4 Lc | 4-2 




2-1 | 2-5 




-•-'ifg*r-.' -•«* 



-2SL 
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Tissue Specificity, and Tetracycline Induction 
ofbGHinLine 10-2 
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345 Repressor Construct 



Xba 
I 



Xba Nru TAA 
J L 



PolyA 



PEPCK Nte(R 



bGH 
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Induction ofbGH in Construct 345 Offspring 



FOUNDER 32 33 31 14 19 63 63 44 54 

TETRACYCLINE - + - +| + + I - -+ -+-+-+ - + - + 
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F/Q Ufi £ 



_ Expression and Alternative 
rrocessing of the tetR Transgene 




Protected fraffmgn^ 



406 

367 

263 
224. 



TATA Xh» Nru 
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FIGURE /3 



5* Structure of tetR mRNA 

202 203 210 301 304 



5' primer £ £ £ £ p ' fj £ 

3'primer gg gg gg gg gg 
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Mtnee /y 



Composition Analysis of Wild Type TnlO tetR Gene. 
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Codon Window Size = 25 



Codon Bias File: Mouse codon bias 
Genetic Code; universal 
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Synthetic tetR Component Sequences 

LT-1 

EcoR5 and EcoRl 

GATATCGAATrcATGAGTAGATTGGACAAGAGCAAAGTGATCAATAGTGC 

TCTX3GAGCTOTIGAATGAAGTGQ 

TGGCCCAG^GCTOGGAGTC^^ 

AATAAGAGGGCTCTGCTG GATGCATI GG CGGTAOCA GGC 

Nsil Kpnl 

LT-2 

Kpnl Nsil 
GCTCGGTACCIXKjATCCAT^ 

ACAC 11 UI GCCCACIXKjAAGGCGAG AGTTGGCAGG ACTTCCT 

ATCCTAAGAGTITC^GATGTGCICTGTTGA 

GTGCACCTGGAATTCGAGC 
ApaLl EcoRl 

LT-3 

EcoRl ApaLl 
GCTCGAATTCAAAGTGCACCTGGGTAGAAG 

AGACeCTGGAGAACCAGCTGGCATTTCTGT^ 

GAGAATGCATTGTATGCTCTGaGTGCTGTGGGTCACTTCAC^ 

tctcctggaggaccaggagcaccaggtggccaaggaggagagggagaccx; 
caaccactgacagcatgccccggatccgagc 

Sphl BanHl 

LT-5 

BamHl Sphl 

gctcgg atccacagc atgccccc attg ctg AG ac aggcct atg agctgtt 

TG ACCACCAAGGGGCAG AGCCTGC 1 1 1 1C1G11 I GGCCTGG AGCTCATCA 

TCTGTGGTCTGG AG AAGCAGCTG AAGTGTG AG agtggctcctg aagcttg 
ATATr Hind3/EcoR5 
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Sequence of Synthetic tetR Gene. 



GATATCGAAT 'TCATGAGTAG 
TCAATAGTGC TCTGGAGCTG 
AGGTCTGACT ACCAGAAAGC 
GAGCAGCCAA CATTGTACTG 
CTCTGCTGGA TGCATTGGCC 
CCATACACAC TTCTGCCCAC 
GACTTCCTGA GGAACAATGC 
TGTTGAGCCA CAGAGACGGT 
AAGGCCAACA GAGAAACAGT 
CTGGCATTTC TGTGCCAACA 
CATTGTATGC TCTGAGTGCT 
TTGTGTCCTG GAGGACCAGG 
GAGAGGGAGA CCCCAACCAC 
TGAGACAGGC CATAGAGCTG 
GCCTGCTTTT CTGTTTGGCC 
CTGGAGAAGC AGCTGAAGTG 
TGATATC 



ATTGGACAAG AGCAAAGTGA 

TTGAATGAAG TGGGCATAGA 

TGGCCCAGAA GCTGGGAGTG 

GCATGTGAAG AATAAGAGGG 

ATTGAGATGC TGGACAGACA 

TGGAAGGCGA GAGTTGGCAG 

TAAGAGTTTC AGATGTGCTC 

GCTAAAGTGC ACCTGGGTAC 

ACGAGACCCT GGAGAACCAG 

AGGCTTCAGC CTGGAGAATG 

GTGGGTCACT TCACACTGGG 

AGCACCAGGT GGCCAAGGAG 

TGACAGCATG CCCCCATTGC 

TTTGACCACC AAGGGGCAGA 

TGGAGCTCAT CATCTGTGGT 

TGAGAGTGGC TCCTGAAGCT 
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Compositional analysis of Synthetic tetR 
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